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Traditionally standardized tests of achievement were used to report to 
parents and to monitor state and district trends. Such tests had almost no effect on 
instruction because teachers paid so little attention to them. According to a national 
survey conducted by Goslin in 1967, teachers only inf eguently used the results of 
standardized tests and reported virtually no influence of test content on teaching 
methods or course content. This picture changed, however, beginning with the 
minimum competency testing movement in the 1970s and educational reforms in the 
1980s. It was the explicit intention of reformers in recent decades to change 
instruction by imposing tests. 

The debate about the positive or negative effects of testing on instruction is 
^ now a familiar one. Proponents of test-based reforms saw tests as a means for 

^ external agencies, such as the state or district, to set standards and assure their 

attainment. This position was best exemplified by Popham's (1987) advocacy of 
measurement-driven instruction. Popham argued that if tests measure important 
skills and have sufficiently high stakes, they will serve as "instructional magnets" thus 
dramatically improving the efficiency and effectiveness of instruction. Opponents of 
high-stakes, externally-mandated tests protested, however, that such tests would have 
a deleterious effect on the quality of education by narrowing the focus of instruction 
to only tested content and by encouraging presentation of content in fragmented bits 
(Bracey, 1987). 

Evidence to support the positive claims for measurement-driven instruction 
comes primarily from nigh-stakes tests themselves. For example, Popham, Cruse, 
Rankin, Sandifer, and Williams (1985) and Popham (1987) pointed to the steeply 
rising passing rates on minimum competency tests as demonstrations that MDI had 
improved student learning. In South Carolina, for instance, the percentage of first 
graders passing the state's basic skills assessment in reading increased from 70% in 
1981 to 80% in 1984. In first grade mathematics the passing rate increased from 68% 
to 81% (Popham et al ., 1985). Similar gains were reported m other grades levels and 
in other states. Popham et al . (1985) also observed instructional changes being made 
in response to high-stakes tests, such as the use of test specifications as instructional 
guides, and noted that the greatest gains in achievement occurred where there had 
been the greatest efforts to direct instruction toward the test. 

It is possible, however, for test scores to go up without there being a 
commensurate gain in learning. The possibility that high-stakes testing programs can 
produce inflated or spurious re6ults was suggested, for example, by the Cannell's 
(1987) finding that all 50 states claimed to be above average. Independent evidence 
of achievement trends from the National Assessment provides both good news and 

, Paper presented at the annual meeting of the American Educational Research 
Association and the National Council on Measurement in Education, Chicago, April 
1991. 
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bad news for proponents of measurement-driven instruction. Student performance 
has improved; according to the NAEP, in basic reading and math skdls dunng the 
period of high-stakes testing. At the same time, however, there has been no cam or a 
decrease in higher-order, advanced skills. Many attribute this pattern of results to 
the negative influence of standardized testing on teachmg and learmng. 

Evidence documentinji the negatwe influenoe of testing on ir«truction is 
limited to a few studies. Darfing-Hammond and Wise (1985) reported that teachers 
in their study were pressured to "teach the test." Teachers gave specific examples, 
such as being asked to write materials similar to the test to give their students 
practice, and givinB up on essay tests so that classroom experiences would more 
closely paraUelstandardizcd tests. State and district testing durcctors mteiviewed by 
SheparS (1990a, 199f/b) acknowledged that teachers generally "spend more time 
teaching the specific objectives on the testfs) than they would if the tests were not 
required." Testing directon were divided, however, as to whether they saw the 
influence of tests as positive or negative. Some bebeved that focusmg instruction was 
beneficial because it ensured that essential skills were taught. Others resetted that 
higher-order thinking skills and subjects like science and social studies suffered 
beca\ise of the emphasis on basics. 

Smith, Edelsky, Draper, Rottenbcrg, and Cheriand (1990) conducted an 18- 
month observational study of testing effects in two schools. Their key findings, 
summarized in Rottenberg and Smith (1990), were as foUows. (1) "Brternal testing 
reduces the time available for ordinary instruction." (p. 5) Smith fiLal. estunatcd that 
in elementary grades test-related activities including test preparation, internal 
testing, and recovery from testing as well as the test administration HSLSS. take over 
100 hours, equivalent to a fiiU three to four weeks of school. (2) "In high stakes 
environment?, schools neglect material that the external tests do not mclude. (p. 6) 
Specifically, t eachers spend little time on science, social studies, and wnting, 
concentrating instead on reading, word recognition, recognition of enors in spelling, 
-age, pun?tua«ion, and arithmetic operations. "External testing encouraees use 
instructional methods that resemble test 
lere test scores fell just short of a year's { 
__ily review program that required pupils 1 
grammar, usage, punctuation, and capitalization. 

Romberg, Zarinnia, and WiUiams (1989) reported survey results for a 
national sample of eighth-grade mathematics teachers. Teachers were asked about 
the uses made of state anddistrict test results and about the influence of tests on 
teaching. Teachers said that as a result of testing they had increased instruction m 
areas such as basic skills (30% of the 354 respondents)jpencil and paper 
computation (25%), topics emphasized on the test (24%), problem sohmg (23%), 
and direct instruction to the whole class (16%). Instructional activities that 
decreased in response to testing were extended project work (19%), use of 
calculators (16%), topics not emohasized on the test (14%), use of computers (13%), 
and cooperative learning (10%).' Romberg fiLftl. concluded that instructional 
changes promoted by standardized testing were ant' thetical to the kmds of 
instructional changes sought by the mathematics communing and represented m the 
NrTM r»rricu1um and ^luatj ffn Standards for School Mathematics f 1989^ 

The present study was pari of a larger research project concerned with both 
the effect of testing on instruction and on student learmng. In the larger study 
(Koretz, Linn, Dunbar, & Shepard, 1991) the effect of standardized testing on 
student achievement was examined by administering a variety of other measures, 
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both standardized tests and alternative tests, to see if students really knew what they 
appeared to know based on their publicly-reported high-stakes test scores. The study 
reported here addressed the effects of standardized testingon instruction. It was 
intended in particular to extend the work of Smith fiUl- (1990) and Romberg £Lal. 
(1989). What do elementary teachers in two high-stakes distnctt perceive to be the 
influences of testing on their teaching? Are hioi-stakes uses or pr^ure to raise test 
scores associated with instructional changes? what specific kinds m^mstructional 
activities do they change in response to tests? We were also interested m the kinds of 
uses made of test results, and in teachers* perceptions of the importance wven to test 
results? Finally, how much time is spent testing and in preparing for tests7 What 
kinds of test preparation strategies do teachers use? Are teachers aware of cheating 
or questionaole testing practices? 



METHODS 

Teachers in two hi^h-stakes school districts were surveyed by means of a 
questionnaire addressing test-preparation practices and the enects of testing on 
instruction. Sampling prccedures, response rates, and the survey instrument are 
described in the fellowmg sections. 

Sampling and response rates 

The high-stakes districts in this study were the same ones described in Koretz, 
Linn, Dunbar, and Shepard (1991). The; were selected according to the following 
criteria: (1) their standardized testing programs were described by their Directors of 
Testing and Research as having veiv nigh stakes; (2) they represented different 
geographic regions of the country; f3) they admimstered veiy popular but different 
standardized tests; (4) they had sumciently large minoriw populations; and (S) the 
Directors of Testing and Research and district personnel were willing to participate. 
For purposes of the Koretz etal . study it was also essential that only Targe districts be 
selected to allow for different random samples of classrooms to take different 
alternative tests. District A is located in the southeast and has an enrollment greater 
then 60,000 with approximately 30 percent minority students. District B is located in 
the southwest and nas an enrollment greater than 50,000. About three-quarters of 
District B's schools have minority enrollments above 70 percent. 

Within the two high-stakes districts the intended sample of teachers consisted 
of two groups. First, we wished to include all the third-, fifth-, and sixth-grade 
teachers in the random samclcs of schools selected for the pupil testing part of the 
project. In addition, becpusv we feared that teachers* responses might be biased by 
the effect of giving an ada tional test as part of the research study, we identified a 
second random sample of schools and targeted the third-, fifth-, and sixth-grade 
teachers in those schools as well. 

Because neither district had directories of teachers names available, 
questionnaires were distributed through school principals. Individual stamped 
envelopes were attached to each questionnaire to enable returns directly to the 
researchers. Principals were also asked to return a form with exact counts of the 
third-, fifth-, and sixth-grade teachers in each school. 

Because of the constraints of the student testing portion of the research, tlie 
distribution of the teacher questionnaires was poorly Xupid. To avoid illegitimate 
practice on parallel forms of the districts' regular standardized tests, the parallel 
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forms test could not be administered until after the districts' normal standardized 
testing period in April. It was also decided that the teacher questionnaire should not 
be administered until after all standardized testing had been completed for the year. 
As a result the teacher questionnaires were distributed the third week m May, 1990 
with only two weeks remaining until the end of the school year. This timing 
undoubted^ reduced return rates. 

To calculate response rates we estimated the numbers of teachers in the 
intended samples from the counts sent to us by responsive principab plus the 
estimated numbcis of teachers in schools where the principal did not respond but at 
least one teacher did. We did not include in the estimate counu for those schools 
where we never heard from the principal or the teachers, because we beueved m 
these cases that the questionnaires were never distributed to teachen. In District A, 
80 teachers responded for a response rate of 37%. In District B, the response rate 
was helped by a follow-up letter sent to principals in August just as teachers were 
returning to school; 280 teachers responded from District B, or 44% of the intended 
sample. Altogether 360 teachers responded, representing approximately 100 
schools, for a combined response rate of 42%. 

Teacher questionnaire 

A four-page quesionn ^ire was developed to include questions in the 
following categones: Pressure io improve test scores, instructional effects, 
preparation for tests, controversial testing practices, uses of test data, positive and 
negative effects of standardized testing, and background information on teachers and 
schools.* Specific questionnaire items are displayed in the results section of the 
paper. We also provided teachers with two open-ended questions asking them to 
report on specific examples of positive or negative influences of standardized tests on 
their teaching or on students in their classroom. 



RESULTS 

Factor analysis 

To simplify reporting of results we wished to consider groupings of similar 
items. Factor analysis was used to check on the validity or meaninghiTness of 
intended subscales. Principle axis factoring with iterated communalities was applied; 
because the factors were expected to be correlated an oblique rotation was specified. 
Using a criterion of an eigenvalue greater than one, it was possible to extract 20 
factors. However, examination of the scree plot suggested either 11 or nine factors. 
We eventually settled on a nine-factor solution because this produced more 
interpretable results without items in the last factors loading on muhiple factors. The 
results of the factor anah^sis are reported in Table 1. 



*In devising questions on instr uctioral changes we drew specific examples from the 
study by Romberg et al . of secondary mathematics and developed our own exemplars 
for reading and for elementary grades mathematics. We are also grateful to Mary 
Lee Smith for suggesting test preparation and instructional effects items basr^i on the 
Smith et al. study. We thank Evelyn Brzezinski and Michael Hiscox of Inte wsst 
Applied Research for their advice and help in designing the final format of the 
questionnaire. 
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Whereas in advance we had anticipated that there might be one general 
factor explaining attitudes toward tests, pressure for accountability , ana instructional 
effects, instead we found that there were discrete fsUors corresponding to each of 
the subparu of the questionnaire. In addition three of the questionnaire subparts 
were each further subdivided into two distinct but correlated factors. Factor 1 
included all of the items about controversial testing practices; these itenis were used 
then to create the Controversial Testing Practices subscale (Controv). The specific 
items comprising this scale are listed in Table 6. Factor 2 and Factor 7 wete 
composed of subsets of items from the Testing and Instruction part of the instrument 
(see Tables 3 A 4). We had expected that when teachers emphasized basic skills 
instruction, vocabulary lists and the like, they would necessarily jghre less attention to 
higher-order skills ana acthdties such as extended project work m mathen atics. 
Instead, items in Factor 2 which we labeled Skills Instruction (Skillins) and those in 
Factor 7 which we called Divergent Instructional Practices (Divergen) formed two 
distinct factors. Interestindy these two factors were slightly positively correlated (r = 
.2) rather than inversely related. 

Factor 3 corresponded exactly to the Test Preparation (Testprep) subscale 
(see Table S). Factors 4 and 9 were subdivisions of the Test Use questions (see 
Tables 7 & 8). Because one set of items referred to uses of tests such as newspaper 
rankings, comparing districts and comparing srhools we called it the External Uses of 
Tests scale (ExtuseT. The other set, consisting of items such as evaluating teachers, 
evaluating principals, and allocating resources, was labelled Internal Uses of Tests 
(Intuse). 

The positive and negative statements devised to assess the effects of 
standardized tests were sorted precisely into two separate factors. Factors 5 and 6. 
The Pro Standardized Testing scale (Pro-tests) included items such as, "Standardized 
testing is helping schools improve," and "I'ests give me important feedback about 
how well I am teaching in each curricular area.^ The scale we identified as 
Measurement-Driven Instruction (Measdriv) was marked by items such as, "There 
are exciting new curriculum developments (e.g., whole language, minds-on-science) 
that I haven't implemented because they aren t compatible with the tests we're « 
measured by," and "I tend to drill students on basic skills because that is the only way 
I can be sure they will get what they really need to know." 

Five items about sources of pressure to raise test scores comprised Factor 8 
(Pressure). Onfy four items in the entire questionnaire did not fit with their intended 
subscale: writing essays, use of multiple choice exercises, timed computational 
exercises, and the use of test scores to promote or retain students. These items were 
omitted from the analysis of subscales and are reported on separately. 

Pressure to raise test scores 

Consistent with the selection of these districts as high-stakes sites, teachers 
reported that they are under substantial, even great pressure to raise test scores. As 
shown in Table 2, 53% said that they feel great pressure from the district 
administration or board of education to raise scores. Only 8% responded that the 
pressure from the district was slight or non-existent. The next strongest source of 
pressure was the newspaper or media. Most teachers did not, however, report 
feeling pressure from parents or from other teachers. The results were generally 
consistent across the two districts except that teachers in District B reported 
significantly greater pressure from other teachers. 
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Even within these districts where testing was thoudit to be pervasively high- 
stakes, variation in perceived pressure was associated with other effects. For 
example, as shown later in Table 13. the Pressure scale was wffelatcd with tca^^^ 
reports of measurement^iriven instructional practices (r « .30), with internal (to the 
district) uses of test results (r - .29), and with the extent of test preparatio- practices 
(r - .24). 

Instroctioiuil tctivitief emphasized because of testing 

Data in Table 3 iUustrate the instructional shifts that have occurred because 
of testing. Two-thirds to three-quarters of all teachers dve more emphasis to basic 
skills instruction, vocabulary lists, word recognition skills, and paper and pencU 
computation than they would if there were no mandated tests. It should be noted 
that Uie pervashreness of test-dL-ected instructional changes in *e$c high-stakes 
districts b far greater than those reported by Romberg iajlv(l?89) where only 30% 
of teachers reported greater emphasis on basic skills and 25% had increased their 
emphasis on pencil and paper computation. Our sample differed troin the Romberg 
study in several respects: (I) It focused on districts where testiiig was Imown to be 
hitth-stakes. while Romberg's nationally representatwe sample mcluded a vanew of 
high- and low-stakes settings including 13% in which students took no mandated test 
in mathematics. (2) Our study addressed testing in the elementary grades which we 
expect to create higher stakes for individual teachers than testing in secondary 
schools. (3) It is also conceivable that time is a relevant vanable, as suggested by the 
Director of TesUns and Research in our largest district who has seen an mcreasingly 
frenetic attenticn to test results despite there ha^dng been no formal changes in the 
district or state testing programs. 

The inclusion of the "reading for understanding" item in the skills instruction 
subscale and the finding that teachers reported increasing this activity in response to 
testing may seem anomalouii if one thinks of this as a high-order, thmkmg activity. 
However, we liken this finding to similar results in the Romberg fiLal- survey where 
83% of mathematics teachers said that standardized tests measure problem solving 
and 30% said they increased their emphasis on problem solving because of mandated 
testing. Romberg fiUl- (p- 84) pointed out that teachers appear to consider simple 
word problems as instances ot problem solving rather than adheringto the more 
ambitious conceptions of problem solving recommended by the NCTM Standargg- 
Although we do not have evidence here of the kinds of activities teachers are using 
when they emphasize "reading for understandine," it is plausible that they mean 
activities like those cited by Popham fiLal- (1985), i.e., findine the main idea and 
answering questions about passage details m parallel to reading comprehension test 
questions. 

Data in Table 4 show the effects of testing on instructional practices that 
might have been expected to be slighted if teachers give more emphasis to basic 
skius. Only the first question, however, clearly followed the predicted pattern.^ Half 
of the teachers reported giving less emphasis to "subjects which are not tested 
because of standardized testing. For most of the remaining questions the modal 
response was to say that these various acthdties were not influenced by standardized 
testing. Substantial numbers of teachers, however, reported responding to testing by 
increfiing activities such as "kids talking about what's been read" (41%), "extended 
project work in mathematics" (24%), "reading in books about social studies and 
science" (42%), "sustained silent reading" (40%) and Vork with mampulatiyes in 
mathematics" (47%). A clear majority of teachers said that they increased cntical 
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(hdiirins activities" and "practice in divergent problem solving," 63% and 57% 
respectively. 

When the results in Tables 3 and 4 are taken toffether it is clear th&x the great 
majority of teachers give greater attention to basic sldlls because of standardized 
testing. Some teachers follow the predictable pattern of increasing practice on the 
basics at the expense of more d^ergent instructional ei^riences. However, a 
greater number of teachers appear to increase all activities, especially those 
pertaining to reading and math instruction. The apparent contradiction, for 
example, ly;tween items 6 and 23, could be explained by saying that teachers give less 
emphasis ta subjects not tested (which includes social studies and science in these 
districts), but they have students do more reading in books about social studies and 
science, because this is a reading activity. This interpretation is consistent with the 
observation of Smith fiLfil* (1990) that with few exceptions, "science at the 
intermediate grades looks more uke reading all the time. Teachers feel they cannot 
afford to take the time required to set up science activities or do divergent problem- 
solving. Hence, they spend the time having the pupils read the text and answer the 
questions at the back of the chapter and take the unit tests (Rottenberg & Smith, 
1990, p. 7)." Thus without actual evidence of classroom practices, the claims that 
criticiall thinking and divergent problem solvins are increased should be interpreted 
cautiously, especially considering the kinds of instructional practices these same 
teachers describe in the narrative data in the final section of the paper. 

ITie separate character of the Skills Instruction and Divergent Instruction 
scales can be seen in their distinct patterns of association with other variables. As 
shown in Table 13, the Skills scale correlated the most stronely with Test Preparation 
(r « .34) and with Measurement-Driven Instruction (r « .27), whereas the 
correlation of the Divergent scale with these variables was .09 and -.18, respectiv ' * 
Both Skills and Divergent were correlated with the Pro-tests scale (r « .20, and .28). 
The Divergent Instruction scale correlated negatively (r « -.40) with Controversial 
Testing Practices and negatively with the number of instructional days spent testing 
(r = -.27). 

Test preparation 

In the preceding section we considered how teachers saw the character of 
normal instruction chancing in response to testing. Test preparation activities also 
significantly change the instruction children receive in a school vear to the extent that 
test preparation supplants normal instruction. Half of the teachers in these high- 
stakes districts spend four or more weeks giving students worksheets to review 
content they expect to be on the test and giving students practice with the kinds of 
item formats that are on the test. The majori^ of teachers also reported spending 
two weeks or more giving students commercially produced test preparation 
materials, giving practice tests, and instructing students on test-takmg strategies. The 
only test preparation acthdty that teachers did not report using extensively was giving 
old standardized tesu for practice. The most telling finding, concerning the influence 
of test preparation on instruction, was that 68% of the teachers reported conducting 
these test preparation activities "regularly," that is "throughout the school year," 
rather than limiting them to a few days or weeks before testing. 

It should be noted that this was the only question set where there were 
significant differences between the two districts, with District B reporting more 
extensive test preparation activities than District A. The two distncts did not differ, 
however, in the pervasive effect of test preparation throughout the school year. 
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In addition to the previously mentioned correlation between test preparation 
and instructional emphasis on basic skills, test preparation was also coirelated with 
the Measyrement-Dnven Instruction scale (r - .26) and with the use of multiple- 
choice fonnats in instruction item (r - .26). Test preparation also correlated .42 with 
increased instructional emphasis on 'Switing essays" probably because m Distnct B 
teachers were preparing students to take a mandated writing test. 

Omtroversitl testing practicet 

The section of the teacher questionnaire entitled "Controversial Testing 
Practices" included a range of test administration practices from those tiiat are 
clearly cheating to those that might boost scores without being considered unethical. 
Becaiie of the sensitive nature of these questions teachers were asked to indicate to 
what extent they were aware of these practices occurring in their schools, rather than 
being asked to report on their own behavior. Approximately one-quarter of the 
sample circled the unknown response categoiy. 

:tices the majority of teachers sai 
thAi'r srhnols. For example, 4991 

sly or never happened; and 
"giving students more time tnan test aireciions cau lor iwe^ or never happened 
according to 58% of the teachers. Of course, the negative side to these same data 
suggest that these kinds of practices do occur to some limited degree. For example, 
23% of teachers reported that "providing hints on correct answers'; occurred 
occfisionally or frequently. "Rephrasing questions during test admingtration was 
thoufiiit to occur occasionally or frequently by 18% of tiie teachers. &^en the two 
practices which were rejected by the greatest numbers of teachers, "Changing 
mamect answers to correct ones on answer documents," and "Encouragmg students 
who would have trouble on the test to be absent," were said to happen occasionally 
or frequenUy by 6% and 8% of the teachers respectively. 

The most frequently reported controversial testing practices were: "giving 
practice on highly similar passages," "rephrasing questions during test 
administration," providing hints on correct answers," and "giving stu^J«?iU:i more time 
than test directions call for." 

The controversial testing practices scale had some of th»r .• troi>jest 
correlations with other scales. It correlated .42 with Internal Lte *of T^sts such as 
evaluating teachers or evaluating principals, and .24 with Extern; ? s.'^s of Tests such 
as ranking schools in the newspaper. Controversial testing pracli'/<^v also correlated 
.45 with the Measurement-Drwen Instruction scale and .37 with the use of test scores 
to make student retention and promotion decisions. 

Internal and external uses of test data 

Responses about the ways that test data are used in the two districts are 
summarized in Tables 7 and 8. Given tiiat many of tiie specific questions pertain to 
district-level practices that should be constant for all teachers within each distnct, the 
variation in results must reflect differences in perceptions as much as actual 
differences in practice. For two of the questions, in fact, the majority of teachers 
circled the question mark indicating that they did not know whether test scores were 
used "to allocate or withhold extra district funds" or "to decide how to allocate non- 
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monetary district resources." Therefore, teachers may not have accurate information 
about alYof the uses of test scores. 

Half of the teachers reported that tests are used occasionally or frequently "to 
compare or evaluate teachers;" 45% said that they are occasionally or frequently 
used to evaluate principals; and 53% said tests are used occasionally or frequently "to 
determine awards for school excellence." On the &st three items, evaluating 
teachers, evaluating principals, and allocatine district funds there were significant 
differences between the two districts, with District B reporting these uses more often. 

In general externa! u^es of test results were reported to be much more 
frequent Tests are used to compare districts, to rank schools in the newspaper, and 
to compare schools; 76%, 76%, and 71% of teachers said these uses occurred 
frequently. Teachers reported the use of test scores slightly less frequently for 
decisions about curriculum and to evaluate innovative programs. 

Although constituting distinct factors, the Internal and External Use scales 
correlated with each other .51. As has already been noted these two scales in turn 
correlated with controversial testing practices and with the degree of pressure felt to 
raise test scores. 

Positive and negative effects of standardized testing 

Questions about the positive and negative effects of standardized testing 
comprised two scales. Fro Standardized Testing and Measurement-Driven 
Instruction, which are reported in Tables 9 andlO. Although the items on the Pro 
Standardized Testing scale are correlated, meaning that respondents who agreed 
with one item also tended to agree with the other items, there was also a number of 
respondents "in the middle" so that the majority agreeing or disagreeing switched 
sides from item to item. Overall teachers rejected more of the pro-tesung 
statements. The items on which a majority either disagreed or strongly disagreed 
(with corresponding percentages) were "standardized tests help to clarify important 
learm'ng goals" (65%), "standardized testing is helping schools improve" (64%), 
"without tests to enforce standards, students would be promoted without prerequisite 
skills" (71%), "the importance attached to test results gives teachers a sense of 
common purpose" (72%), "teachers who complain about testing are usually poorer 
teachers who do not want to be accountable" (77%), and "focusmg on tested material 
first ensures masteiy of i.he basics before going on to other matenal" (64%). 

Only two pro-standardized testing statements received endorsement from a 
clear majority of teachers: "standardized test results are helpful in identifying student 
strengths and weaknesses" (72% agreed or strongly agreed) and 'low test scores help 
get additional resources to students with the greatest Teandng needs" (53% agreed or 
strongly agreed). Teachers were equally divided on the remaining two statements: 
"tests ffvt me important feedback about how well I am teachmg in each curricular 
area," and "my scnoors emphasis on test results shows a real commitment to raising 
student achievement." 

The pattern of results on the Measurement-Driven Instruction scale reported 
in Table 10 was similar to that for the Pro-Standardized Testing scale. Individual 
items were strongly intercorrelated meaning that there were many teachers who 
either consistent^ endorsed or rejected most of the items; however, there we. e also a 
number of teachers in the middle who switched sides thus shifting the majority from 
item to item. 
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Four of the Measurement-Driven Instruction items were agreed to by a 
majority of teachers: "I spend more time teaching reading and math and less tmie 
teaching social studies and science because reading and math test scores are so 
importsmt" (64%), "I use fill-in-the-blank worksheets and matching exercises m my 
regular instruction so that my students will be comfortable with short answer formats 
when it comes time to test" (60%), "When I teach reading and math. I emphasize the 
skills and content I know are on the standardized tesu** (69%), and "A lot of the 
workbook and textbook activities I select for studente to do are very smiilar to me 
short passages and stand-alone questions that students will encounter on tests 
(62%). 

Test-driven items with which the majority of teachers disagreed were: 
"higher-order thinking skills are something I get tc only if there is time after covering 
the basics** (61% disagreed), and **I don't use essay tests during the year because I 
want my students to have practice with standardized test item formats" (75% 
disagreed). As noter! previously, many teachers in District B reported giving students 
practice writing essays oecause they face a mandated writing test. 

On the remaining four items, teachers were equally divided as to whether they 
agreed or disagreed: "My scnool is more interested in increasing test scores than in 
improving overaU student learning," "Gifted children get to do enrichment activities 
but at-risk children have to keep drilling on the basics," "There are exciting new 
curriculum developments that I haven't implemented because they aren t compatible 
with the tests," and"! tend to drill on basics because that is the only way I can be sure 
they will get what they really need to know." 

Recall that the pro-standardized testing scale was positively correlated with 
divergent instructional practices. In contrast, the measurement-driven instruction 
scale was negatively correlated with divergent instruction and positively correlated 
with pressure, skills instruction, test preparation, controversial testing practices, and 
both internal and external uses of test results. 

Odd items not included in the questionnaire subscales 

Three instructional influences items and one test use item did not correlate 
well with their intended subscales; data for each of these questions are reported 
separately in Table 11. Writing essays was expected to be a divergent instructional 
practice while use of multiple choice exercises was expected to go with basic skills 
instructional emphases. Instead both of these activities were reported to receive 
greater emphasis as a result of standardized testing, by 71% and 60% of teachers, 
respectwely, and both corelated most highly with the test preparation factor. Giving 
more emphasis to "timed computational exercises" was reported by 43% of the 
teachen. This item did not con elated with any scale in the questionnaire, except for 
a .18 correlation with test preparation. 

Forty percent of teachers reported that test scores were occasionally or 
frequently used to mike student promotion or retention decisions. Use of test results 
in Ais way positively conelated with test preparation practices (r « .19), with 
controversial testing practices (r « .37), w. h other internal uses of test results (r = 
.44), Riid with external uses of tests (r » .2t If^ factor loading was actual^ higher 
for the controversial testing practices factor ' the intended test uf e factor, 
hence the decision to leave it out of the scales. 
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Ttme spent fiving itandardized tests 

TWo final questions were asked about the number of hours spent giving 
standardized tests and the number of days interrupted by testing. Becai^ it had 
been anticipated that the student testing portion of the larger research project might 
actually affect teacher questionnaire responses regardins some aspects of testmg, the 
study was designed with teachers selected from two randomly equWalent samples of 
schools, one sample was invoKed in testing and the other not. Hccause there were 
significant differences between the two samples of teachers on the two questionnaire 
items pertaining to the amount of time spent testing, daU in Table 12 are reported 
onfy for the respondents in the nontesteo schools so that our own testing would not 
inflate the resulu. Note that responses fo? the tested and nontested research groups 
did not differ significantly on other questionnaire scalu. 

In District A 58% of the teachers reported spending from 4-8 hours per 
school year giving standardized tests, thus affecting 3-5 school days; 29% reported 
spending 9-16 hours testing, with 13% reporting 17 hours or mat given to testing. 
The time sivcn over to testing in District b is clearly greater with 52% sa)(ing that 9- 
16 hours ^e spent in actual test adnunistrations. IWenty-eight percent of the 
teachers in District B sp«!nd 17 hours or more testing. Not counting test j^reparation, 
teachers in District B Typically report spending two entire weeks simply ^ving 
standardized tests. 

Open-ended questions about the positive and negative effects of standardized testing 

Teachers were also asked to respond to u|M;n-ended questions about the 
effects of standardized tests. Tliey were asked i ~ give positive examples of how tests 
helped to improve the quality of education in their classroom or for particular 
children, ana they were asked to give examples of any negative influence of tests on 
their teaching or student leaminj. Table 14 provides a kngthy display of teachers' 
written responses from District B. A comparable analysis was done for District A. 
There were no apparent differences in types of responses between the two districts, 
except for the group of respondents in District B who said that the writing test had 
had a positive effect on their students' writing. 

Based on an initial reading of the data, it was decided that the data summary 
should reflect the overall position of each teacher as well as aggregate the n?<iny 
particular positive and negative exanoples. Therefore teachers were first sorted iiito 
four groups: Nonrespondents (N«47, 17%), Only Positive (N« 17» 6%), Both 
Posiv^e and Negative Answers (N»150, 54%), and Only Negative (N«66, 24%). 
Then the positive and negative answers wer^ separated and sorted mto 
subcategories. Data within each categoiy were reread aiid resorted by both authors 
until the cases were homogeneous within categories and distinguishable between 
categories. In creating the data table, examples were selected to include both the 
most typical statements as well as those that were the most different from the 
categoiy summary. When many teachers gave highly similar answers they were 
counted but not quoted in the table. 

Many teachers gave two or three positive and two or three negative examples. 
However, within the positive and negative sets only the first examplrt was used to 
classify a teacher. Therefore the counts for all the subcategories add up to the total 
L that categoiy. In all but two instances teachers' second and third answers 
duplicated what h:id been said by other teachers' fin»t answers; in these two cases 
teachers said that a negative effect of standardized testing wa^ to "slow down gifted 
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Students." We did not add in the duplicate counts for teachers second and third 
answers because it would make it difficult for the reader to make the numbers add 
up." However, it should be remembered that many of the high frequency answers 
both positive and negative would have had even greater endorsements if these counts 
had been added. 

GWen all of the national furor about standardized testing, one of the most 
notable aspects of these data are the many specific positWe examples teachers 
provided of beneficial uses of tesu. To be sure there were many negative answers 
oven, and quantitatively they outwei^ed the positive, but the majority of teachers 
gave considered, balanced answers. The balance shown argues for the integrity of 
the data and to a certain extent makes the negative statements all the more 
compelling because they did not appear to be reflexive, knee-jerk statements against 



tests. 



Six percent of the teachers in District B gave only positive answers and an 
additional 54% cited positive as well as negative effects. Many of the positwe 
answers are classic "textbook" examples ofhow test data should be used to diagnose 
gaps in student knowledge or to evaluate instructional weaknesses. For example, 
Tlie areas on which a large % score low (usage, etc.) - 1 evaluated and restructured 
my instruction to improve student understanding and mastery." Other positive 
answers vindicate reformers intentions when they mandated external tests. For 
example, "I think the tests encourage teachers to insure learning," and It helps focus 
some teachers on basic skills who may otherwise net focus on anything at all. 

Tne decision was made to provide an extensive data table rather than merely 
a summary of the category labels because this makes more of the data ac^ssible to 
the reader. A great deal of insiditful and meaningful data would be lost if only the 
summary statements were used. We tried to be faithful to the respondent s view of 
things in making our classifications. However, if we were to impose our own values 
on the examples given we might sometimes conclude that positive answers actually 
reflect negative practices. For example, "Cover more material in less time. Give me 
the incentive to drill on a daily basis-for a shfilL Shfid time-fundamental facts, 
"Stress on standardized items and repeated repetition helps the slow learners in the 
group," and so forth. There are also many examples as in the Romberg cLal. study 
where we do not share the same definition of terms as the respondents making it 
difficult to credit statements such as, "(Standardized testing) enables pupils to do 
critical thinking." Therefore the detailed information in Table 14 bears careful 
reading. Depending upon the point of view of the reader, it is possible to see there, 
even in the positive answers, the negatWe effects of standardized tests on instruction. 
Tests set the instructional eoals, provide intense motivation, and encourage teachers 
to address deficiencies skill by skill. 

One quarter of all the teachers in District B (2A%) gave only negative answers 
about standardized tests. They either left the question blank which asked for positive 
examples or said specifically that they "could not think of any." These teachers then 
have to be combined with the 54% who gave negative examples of test influences 
along with positive instances. The single largest category of negative response fi-om 
both groups was the complaint that standardized tests led to "too much teachmg to 
test content and test format." For example, "Critical thinking skills are basically non- 
existent in our chUdren because of drill and practice for (Test 1), (Test 2), and (Test 
3)," "I can*t get to science and social studies, like I would like to. Instead, I m 
preparing my class for a standardized test," 'Too much time is needed to emphasize 
test content, test taking skills, practice work-sheets," and "We are fionslanlly 
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reminded to practice, practice, for the test The fiin and excitement has been taken 
out of teaching." Counting all the responses in this subcategory alone (N « 66 + 28) 
accounts for 3A% of all the teachers who responded from District B, and this does 
not include teachers who gave answers of this type as their second or third respon'^e. 

Other small subcategon'es of negative re ponse add further to the picture of 
instruction distorted by testmg. These include: too much time given to testing and 
the boring effects of test pre{Muration. These two categories from both negative 
groups account for an additional 6% of the teacher sample. 

The other major categories of negative effects had to do with the stress of 
testing for both teachers and students and with concerns about the fairness or 
accuracy of test results. For example, Too much emphasis and pressure from the 
principaj, district and media. In my opinion, the pressure encourages cheating from a 
lot of teachers,"* "Students concentrate on Nvhat will be included on the test,' students 
are stressed out and get too nervous during the test." "Our standardized test scores 
are typed up and passed around to eveiy teacher to compare how they did to others. 
No mention goes to the teachers with low classes It is much too competitive.*' 



SUMMARY 

The purpose of the study was to assess the effects of standardized testing on 
instruction in two high-stakes school districts. Third, fifth, and sixth grade teachers in 
approximately 100 schools were surveyed with a questionnaire. The study was 
intended specifically to examine testing and instruction practices in settings where 
testing is high stakes. Findings clearly cannot be generalized to other contexts. 

A limitation of the study was the 42% response rate. The 360 teachers who 
responded are not necessarily representative of al! teachers even in the high-stakes 
districts. For respondents, however, there were a number of indicators that teachers 
answered carefully and took the task of filling out the questionnaire seriously. Thus 
we judged the quality or inteerity of the data to be good. Only 20% did not complete 
the open-endea questions. On several scales the majority opinion shifted sides often 
suggesting that respondents did not simply go down the page answerins strongly 
agree or strongly oisagree to all items. There were few questions left blank except 
for those questions where we had anticipated that teachers might not know the 
answer anc) had provided a question mark categoiy. The factor analysis showed 
highly internally consistent scales and nuanced distinctions between scales, which 
would not have been possible with careless responses. Answers to factual questions 
such as the amount of time spent testing were consistent with what we were told by 
the Directors of Research and Testing m each district. 

The major findings of the study were as follows: 

1. Teachers reported that they feel pressured to improve test scores by the 
district administratiOD and by the media. 79% of teachers said that they feel 
"substantial" or "great" pressure from the district administration to raise test scores; 
66% said they felt such pressure from the newspapers and media. Within these high- 
stakes districts, perceh^ed pressure to raise test scores was correlated with 
measurement-driven instructional practices, the use of tests to evaluate teachers and 
principals, and the extent of test preparation practices. 
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2. Because of the importMce of itandardized teits, tetchers five greater 
emphasis to basic skills instructioo. Two>thirds to three-quarters of teachers 
reported ovintt more emphasis to basic skills instruction, vocabulary hsts. word 
recognition skflls, and paper and pcncU computation than they would if there were 
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tests." in the narrative data from the open-ended questions, the single largest 
category of negatwe response was the complaint that standardized tests lead to too 
much teaching to test content and test format.** 

3. Non-tested content clearly JiufTers because of the focus on standardized 
tests. Half of the teachers said that they give less emphasis to "subjects which are not 
tested." In the open-ended questions, science and social studies were exphcitljr 
identified as the subjects that are sUghted. Many teachers also said that they give less 
attention to higher-order thinking cjid extended projects because of testing. 
However, a different group of teachers said that they increased these activities 
(especially in reading and math) in response to testing. 

4. In addition to the effect of testing on the character of normal instruction, 
testing also distorts instruction because of the extensive time given to test 
preparation. Half of the teachers in these high-stakes districts spend four or more 
weeks per year giving students worksheets to review content they expect to be on the 
test and giving students practice with the kinds of item formats that are on the test. 
A majonty of teachers also reported spending two weeks or more giving students 
commercially produced test preparation materials, giving practice tests, and 
instructing students on test-taking strategies. The most telling finding, concerning the 
influence of test preparation on instruction, was that 68% of the teacners reported 
conducting these test preparation activities "regularly," that is *'throughout the school 
year,** rather than limiting them to a few days or weeks before testing. 

5. Four weeks of test preparation does not count the one or two tuW weeks of 
school spent giving tests. In District A the modal response for number of school days 
interrupted by standardized testing was 3-5 days. In District B it was 6-10 days, with 
29% otDistrict B's teachers saying that they spent 11 or more days giving tests. 

6. Teachers reported that flagrant instances of cheating happen veiy rarely in 
their schools but other controversial testing practices that would clearly boost scores 
happen more frequently. For example, the two most extreme practices, changine 
answer documents or encouraging low scorers to be absent, were reported by only 
6% and 8% of teachers respectively. However, other practices such as "rephrasing 
questions during test administration," "providing hints on correct answers, and 
'•givinff students more tims than test directions call for," were thought to happen 
(Scasionally or frequently by 18%, 23%, and 20% of the teachers respectively. 

The controversial testing practices scale correlated .42 with internal uses of 
tests such as evaluating teachers or principals and .45 with the measurement-driven 
instruction scale. 

7. Teachers reported extensive use of test results for external purposes such 
as comparing districts, ranking schools in the newspaper, and comparing schools. 
They reported only slightly less use of tests for internal purposes such as evaluating 
principals and evaluating teachers. The more tests were used for these purposes, 
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the higher was the report of controversial testing practices and measurement-driven 
instruction. 

8. Id open-ended queitiont, the mi^ori^ of tenchert could identilV numeroTT« 
benenda] uiei oTitandardiaed tests, such as clearty setting instructional goals, 
providing feedback abont student strengths and weaknesses, and identuying nps in 
instruction. For all but a small fractkm of teachers, boMr^^r, these benefits from 
sUndardlzed testing were offset or greatly outweighed by negative effects, such as the 
amount of instructional time given to test preparation, the amount of stress 
experienced, nnlkir or invalid comparisons, and the demoraliztng effects on teachers 
and students. One-quarter of the teacher sample reported only negative influences 
of standardized tests on teaching and learning. 
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•.04 


-.04 


.02 


.03 


•19 


•.83 


•08 


• 13 


•04 


EPFECT65 


.06 


.06 


.08 


.01 


-.02 


•.82 


••17 


••02 


•06 


bFFECT63 


.25 


•.07 


.00 


.00 


-•23 


•.46 


•01 


•15 


••03 


EPFECT66 


.14 


.07 


•.IB 


.11 


•07 


••48 


••01 


•09 


•04 


EFFECTW 


•.04 


.04 


•.12 


•.04 


•00 


•.46 


.00 


••13 


••03 


EFFECT60 


.10 


.15 


•.04 


•.01 


.06 


•.48 


-•01 


• 14 


•02 


EFFECT69 


.07 


.04 


.12 


m A 

.10 


.02 


•.48 


•.12 


•08 


••05 


EFFECT73 


.00 


•.02 


•.21 


A^ 

•.04 


m A 

.19 


••41 


A A 

•04 


A A 

•.02 


A A 

••06 


INST26 


•.07 


•.05 


.00 


Aa 

.01 


At 

.03 


Aa 

•.08 


A A 

• 65 


« A 

-•10 


AA 

••03 


INST28 


•.01 


.21 


-.08 


AA 

•.09 


A A 

.02 


.08 


•88 


A 4 

••07 


A A 

•06 


INST1S 


-.07 


•.10 


•.Oo 


AA 


A<> 

.03 


A A 

.08 


m A 

•89 


A A 

.00 


AA 

•02 


iNST21 


•.15 


•Id 


AA 

*oo 


^ a A 
•.18 


A^ 

•.04 


A^ 

.04 


m A 

•86 


4 A 

• 12 


A9 


INST23 


•.03 


.09 


..08 


.00 


A A 

.00 


•.02 


• 89 


•.12 


••03 


iNST27 


.02 


.29 


-•09 


AA 

•.08 


AA 

..02 


A A 

.06 


•81 


-.08 


•06 


INST25 


•.20 


21 


AA 

..06 


A^ 

•.04 


AA 

.08 


.03 


A A 

•80 


A A 

.06 


••10 


INST22 


•.14 


.21 


•.02 


-.06 


.12 


•08 


•47 


.02 


••13 


INST18 


«05 


•.11 


•05 


.14 


.i1 


Aa 

.09 


•46 


•.02 


•13 


INST24 


•.08 


.19 


.07 


.02 


.07 


•.05 


• 41 


•.02 


••09 


W8T17 


.05 


••17 


•09 


.01 


.04 


.05 


• 40 


•.09 


•08 


MST14 


•«19 


.92 


.07 


•.04 


•.03 


•00 


•40 


.08 


••09 




•.06 


•.19 


.10 


.01 


.09 


.16 


.30 


•.04 


•01 


INSTie 


.03 


.03 


..13 


.08 


..02 


•.10 


.19 


.03 


•08 


PRESS3 


.02 


•.05 


•.14 


.10 


-.09 


•.03 


•.06 


.61 


•03 


PRESS5 


•.02 


.04 


..14 


.15 


-.03 


.14 


-.13 


•60 


-.04 


PRESS2 


•.02 


.09 


..08 


-.13 


.07 


-.10 


-.06 


• 59 


-.07 


PRESS1 


.16 


.03 


A^ 

-.05 


..08 


A4 

-.07 


-.22 


.02 


.82 


..02 


rnB554 




f% a 

•.01 


Aft 






Aft 

.00 


AC 
• 09 


.47 


AA 


USE55 


.00 


-.06 


..02 


.16 


.01 


-.03 


-.05 


.05 


•.84 


USE56 


.00 


..02 


..02 


.25 


.02 


.04 


••10 


-.01 


•.81 


USE48 


.25 


-.09 


.00 


.14 


-.05 


•.14 


.00 


.20 


-.42 


USE57 


.07 


-.05 


.15 


.25 


-.03 


.05 


.05 


.16 


-.39 


USE49 


•27 


••09 


..07 


.24 


-.11 


-.11 


-.02 


• 13 


>.33 



13 



Table 2 

ITEM MEANS AND FREQUENCIES AND SUBTEST STATISTICS 
FOR TWO HIGH-STAKES DISTRICTS ON THE SUBSCALE; PRESSURE 



PRESSURE FOR IMPROVED TEST SCORES. To what extent do you fed pressure from the following groups 
improve your students' standardlred test scores? 



No 
ProMuro 


Slight 
PrtMuro 


Modtrato 
PrtMuro 


Subitanttal 
ProMuro 


OrMt 
PrMturo 


P/iM • dioefc In tfi« •ppnprlti* column to ihow how much prtuur* 
«A imprAvm t»ai aiMrai you p§r$on%ll)f tool from oicfi of tfio group*. 












1. Myprindpal 












2. Other teachers 












3. District administration or board of education 












4. Parents 












5. Newspaper/media 





Frequencies in % 






Means 


no 
press 
(0) 


slight 
press 
(1) 


mod. 
press 
(2) 


subsL 
press 
(3) 


great 
press 
(4) 


blank 


! 


District 
A 


District 
B 


5.5 


11.1 


25.2 


26.9 


29.4 


1.9 




3.50 


3.69 


26.3 


24.7 


20.8 


19.7 


4.7 


3.9 


1.93 


2.67 


2.2 


53 


10.8 


26.3 


52.6 


2.5 




4.10 


4.29 


37.1 


20.5 


21.9 


11.6 


55 


3.3 


2.04 


2.32 


9.1 


7.5 


15.5 


19.1 


46.8 


1.9 




3.48 


4.01 










Subscale Total Score: 
Average Item Score: 
N- 


Combined 


District 
A 


District 
B 


16.42 (sd-4.22) 


15.04(4.02) 


16.95(4.12) 


3.3 


3.0 


3.4 


350 


80 


262 



Table 3 

ITEM MEANS AND FREQUENCIES AND SUBTEST STATISTICS FOR 
TWO HIGH-STAKES DISTRICTS ON THE SUBSCALE: SKILLS INSTRUCTION (SKILLINS) 



TESTING AND INSTRUCTION. Itertis 6-28 describe a variety o( Instructional activities. Circle the appropriate 
response to show whether the use of standardized tests In your schobi causes you to place more emphasis or 
LESS emphasis on the activity than If there were no mandated standardized iesxs. or whether your emphasis on 
the activity is not influenced by standardized testing. 



9. Basic sklUs in maih«matles 

10. Basic skills m reading 

1 1 . Reading for understanding . 

12. Vocabulary Ista- 

13. Word rtcognWon sl^ills 

1 9. Paper and pencil computation 

20. Direct instruction to the whole class 



ItMtCnytm^ t^^^^x^ Im-fiimlmtlt 

■ Itow C w yft i^ Nctlr^utmta^ t a m bPfi tmH i 
MMClfVfW* NoiMkMnoK/ UM&ipftuft 



Frequencies in% 



Means 





moK 


not 


less 


blank 


# 


emphasis 

(3) 


influenced 
<2) 


emphasis 

a) 




9 


74.0 


24.9 


0.0 


1.1 


10 


75.3 


235 


0.6 


0.6 


11 


72.9 


25.8 


0.8 


0.6 


12 


66.2 


31.3 


2.2 


03 


13 


69.0 


27.7 


2.8 


0.6 


19 


65.7 


29.9 


2.2 


12 


20 


5\S 


42.1 


3.0 


1.4 



District 
A 


DisUict 
B 


2.75 


2.75 


2.70 


2.77 


2.64 


2.75 


2.46 


2.69 


2.58 


2.69 


254 


2.68 


239 


235 



Sttbscale Total Score: 
Average Item Scor«: 
N- 


Gombined 


District 
A 


Disbrict 
B 


18.69 (sd«2.46) 


18.05(2.57) 


18.93(2.37) 


2.7 


2.6 


2.7 


354 


77 


269 



ERIC 
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Table 4 

ITEM MEANS AND FREQUENCIES AND SUBTEST STATISTICS FOR TWO HIGH-STAKES 
DISTRICTS ON THE SUBSCALE: DIVERGENT INSTRUCTIONAL PRACTICES (DIVERGEN) 



6. Subjects which Ve not tested Uameirptmtk ttain^mneta UuC<npAu<i 

14. Wcte talking about wtwi'i been read iio-einp^Mi. iMM«ntf u-iin^ 

15. Extended project work mnialhemalica uomBmimiB imiAj.^ Lm emptmtk 



17. ActlvHIes Involving calculator* »tamiimtmi$ awmmm umn^vhui, 

18. Actlvltlee Involving computere ^umbmtm^ mMmt umbmtmtk 

21. Small group Instnictlon item tm ptrntu MwM«m« umei^tik 

22. Cooperative learning „ — ttcmtmpimm MarMMmMtf_-.^UM6>pMt 

23. Reading In books about social studies and science MMOnp^ im mumm imm impKuk 

24. Sustained ellent reading MMAmAM* — — iMmb>phuu 

25. Work with nwilpulatlves ki nwthemaiics *»«• cwpfiM*— — ^nam^utnm^ «np*M* 

26. Ubraiy projects and report writing nbmempfm»k mm i kM r oaa.. umempnui* 

27. Critical thinking activities fnp/-* mtnmKmi uu c/npAt«i 

28. Practice In divergent problem solving MMfinpAutt NMMMmd^ iMmbiptmtit 







Frequencies in 9 




# 


inon 
emphasis 
(3) 


not 
influenced 
(2) 


less 
emphasis 
(1) 


blank 


6 


10.2 


38.2 


49.6 


1.9 


14 


41.0 


47.1 


114 


0.6 


15 


24.1 


49.6 


24.9 


1.4 


17 


5.5 


59.0 


33.2 


2.2 


18 


13.9 


59.3 


24.7 


22 


21 


34.9 


51.0 


13.3 


0.8 


22 


39.3 


47.6 


12.2 


0.8 


23 


42.4 


41.8 


14.7 


1.1 


24 


39.9 


463 


13.0 


0.8 


25 


46.5 


40.2 


11.9 


1.4 


26 


27.1 


46J 


24.9 


1.1 


27 


62.9 


26.0 


10.0 


1.1 


2& 


573 


305 


11.1 


1.1 



Means 



District 
A 


District 
B 


1.76 


1.55 


2.29 


2.30 


2.10 


1.96 


1.79 


1.69 


2.06 


1.84 


2.23 


2.22 


2.24 


2.28 


2.28 


2.28 


2.20 


2.29 


239 


234 


2.15 


1.99 


2.49 


2J55 


2.46 


2.47 



p<.001 



Subscale Total Score: 

Average Item Score: 
N- 


Combined 


Dittrict 
A 


District 
B 


28.01 (sd»5.18) 


28.51(4.15) 


27.78(5.45) 


2.2 


2.2 


2.1 


344 


78 


258 



ERIC 



^ i. 



Table 5 

ITEM MEANS AND FREQUENCIES AND SUBTEST STATISTICS FOR 
TWO HIGH-STAKES DISTRICTS ON THE SUBSCALE: TEST PREPARATION (TESTPREP) 



TEST PREPARATION ACTIVITIES. Fof items 29-35, tfy to recall how much time over this eittlre school year you 
spent in your classroom on the following test preparation activities. Then mark one box f<3r each item. 

29. Giving students worksheets that review the content you expect to be on the test 

I ]Notimt [ ]Ad«yerlM« { ]2-6dty« ( ]2-9wMkt [ ] 4 or fn»« WMkt 

30. Giving students practice with the kinds of item formats that are on the test 

C ]Notim« f lAdayorlM* [ ]2-5d«yt [ ]2-aw««ks [ ] 4 or mori> wMkt 

31 . Giving students commercialiy produced test preparation materials 

( INotiiiM [ ]Adtyori«M [ ]2-5dcy« [ ]2-awo«k« { ] 4 or mor • ivMkt 

32. Giving students practice tests developed by school, district or state staff 

[ ]No«m« [ ]Ad«yerlMt [ )2-Sdayt [ ]2-awo«k« [ ] 4 or moro mtkt 

33. Giving students old standardized test forms for practice 

[ JNotimo [ ]AdtyorlM« ( ]2-5day« [ ]2-3«vo«k« [ 1 4 ormor«wM,\(« 

34. Instructing students on test-taking strategies 

C ]Notim« [ JAdayorlMi [ ]2*5dtys [ ]2-3wMk« [ ] 4 or more weskv 

35. When did most of the test preparation activities you conducted take place? 

I ] A few days b«for«th« testing [ ] Rtgultrly throuohout th« school year 

( 1 A few wMksbeforo the testing ( j Not •ppiieobic, we didn't do any test preparellon. 



# 
29 
30 
31 
32 
33 
34 



35 



Frequencies in % 



Means 



no 
time 
(0) 


a day 
Driest 

(1) 


2-5 
days 

(2) 


2-3 
weeks 

(3) 


4-1- 
weeks 
(4) 


7lank 


6.9 


3.6 


15.2 


21.9 


513 


0.8 


25 


5^ 


15.8 


24.4 


50.7 


0.8 


12.7 


6.6 


18.8 


25.5 


35.5 


0.8 


75 


9.4 


21.9 


213 


38.5 


1.4 


44.6 


5.0 


175 


13.9 


18.6 


0.6 


0.8 


13.0 


^2.4 


19.1 


43.8 


0.8 


dayi 

hmfnrm 
DCIOR 

(1) 


weeks 
bcfbtc 
(2) 


Mgtt 

(3 


larly 

1) 


not 
applic. 


blank 


5.8 


23J 


68.1 


0.3 


23 



Subscale Total Score: 
Average Item Score (items 29-34): 



CombiMd 



24.69 (sd«632) 



3.6 



350 



District 
A 


District 
B 


3.29 


4.31 


3.49 


4.35 


2.70 


3.92 


2.74 


4.04 


136 


2.85 


3.27 


4.11 



233 



District 
A 



16.25(5.86) 



2.7 



78 



2.68 



Distarict 
B 



p<.001 



26.37(5.40) 



4.4 



265 



9 ' 



Table 6 

ITEM MEANS AND FREQUENCIES AND SUBTEST STATISTICS FOR TWO HIGH-STAKES 
DISTRICTS ON THE SUBSCALE: CONTROVERSIAL TESTING FRACnCES (COI^TRO V) 



CONTROVERSIAL TESTING PRACTICES. Items 36-46 list testing practices some teachers use to Improve 
students' test scores. To w^at extent do you believe they are practiced by teachers In your school? Use this 5- 
point scale to tell us the extent to which each of the practices listed In Kems 38 to 46 occurs In your school; 
simply circle the code that matches the frequency o( each practice. 

N • NEVER happens In my school 0 « OCCASICNAaV happens In my school 

R « RARELY happens In my school F « FREQUENTLY hapf^wln my school 

? > NO IDEA how frequently She practice cxicurs 



36. 
37. 
38. 
39. 
40. 
41. 
4Z 
43. 
44. 
4S. 
46. 



Providing hints on correct answers 

Giving students more time than test directions cad for » 

Reading questions that students are supposed to read themsetv^ . 
Angering questions duiing testing time aboiM test content 



Not administertng the test to students who would have trcutjie on the test.. 
Encouraging students who would have Uouble on the test to be at)sent.... 
Practicing on Hems from the test Itself 



..N 


.n 


0 t 


: •? 




R 


0 f 


: ? 






0 1 


• ? 




R 


.0 1 


• ....? 




R 


0 1 


- ? 


..N.... 




C I 


- ? 


..N.... 




0 1 


= , , . ? 


..N.... 


R 


0 1 


F . ? 


..N..,. 


R 


0 1 


F ? 


..N.... 




0 1 


F ? 


..N... 


R 


0 1 


F ? 







Freouencies in % 
















(1> 


R 

(2) 


" O 
(3) 


¥ 

(4) 






District 
A 


uiscrict 
B 


36 


1S5 


20.8 


16.9 


5.8 


28.0 




1.91 


2.02 


37 


38.0 


19.7 


15.2 


4.4 


22.7 




1.71 


1.85 


38 


38.8 


22.2 


11.9 


2.2 


24.9 




1.77 


1.68 


39 


43.2 


20.5 


8.9 


2.8 


24.7 




1.70 


1J56 


•10 


58.4 


7.8 


5J5 


0.6 


27.7 




1.16 


1.31 


41 


36.3 


20.8 


16.1 


1.9 


24.9 




1.77 


1.79 


42 


50.7 


15.8 


75 


5.8 


20.2 




1.29 


1.68 


43 


60.1 


10.8 


55 


1.9 


2t.6 




1.10 


1.42 


44 


54.6 


12.5 


8.0 


3.3 


21.6 




1.46 


150 


45 


56.8 


11.6 


6.4 


1.9 


233 




1J3 


1.41 


46 


24.9 


15.8 


205 


19.7 


19.1 




2.14 


250 




































t 


District 


District 






Subscule Total Score: 


Combined 


A 


B 














17.17 (9d«6.94) 


1625(5.86) 


17.57(;.26) 






ATCiaKc Item Skore: 


1.6 


13 


1.6 












B 


207 







p<.001 



aNote: The question mirk frequencies include the small percentage of teachers who left the quesHon blank as well 
as those those who dided the ? to indicate that they could not answer. Only response categories 1-4 were used in 
the calculation of means, hence the dramatic reduction in sample size compared to other tables. 

2J 



ERIC 



* 



Table 7 

ITEM MEANS AND FREQUENCIES AND SUBTEST STATISTICS FOR TWO HIGH- 
STAKES DISTRICTS ON THE SUBSCALE: INTERNAL USES OF TESTS (INTUSE) 



USES OF TEST DATA. Items 47-5V Kst sof ne uses of standardized test scores. Use the same 5-poInt scale to 
Indicate how frequently standardized test scores are used In your district for the following purposes. 



ig O O , ,C ? 

48. To compar* Of »valual« ttacntfs, ^....^ - " " ^ 

49, To compar* or tvaluata principals. - — N fl ...0 «F ? 

55. 
56. 
57. 



To allocata Of wfthhoWixlrt district funds N R 0 F ? 

To decide how to allocate non-monetaiy district resources N R 0 F ? 

To determine awards for school excellence N R O F ? 



Frequencies in % 



« 


N 
(1) 


R 

(2) 


O 
(3) 


F 

(4) 


?« 


48 


14.1 


14.4 


22.4 


28.0 


21.1 


49 


12.5 


10.2 


16.1 


29.1 


32.1 


55 


10.8 


8.6 


10.0 


21.1 


49.6 


56 


8.3 


7.2 


11.6 


202 


52.7 


57 


8.3 


4.4 


15.8 


37.4 


34.1 



Means 






District 


District 


p<.001 


A 


B 




2.37 


2.93 


• 


2.26 


3.07 


• 


2.27 


2.98 


* 


2.49 


3.(y, 




339 


3.21 





Subscale Total Score: 
Average Item Score: 
N« 


Combined 


District 
A 


Dislr<ict 
B 


14.15 (sd-4 J5) 


12.28(4.24) 


14.89(4.36) 


2.8 


25 


3.0 


143 


29 


110 



^Note: Tne question mark frequencies include the small percenUge of teachers who left the question blank as 
well as those who drded the ? to indicate that tftey oould not answer. Only response categories 1-4 were used 
in the calculation of means, hence the dramatic reduction in sample size compared to other tables. 



?4 

ERIC 



Table 8 

ITEM MEANS AND FREQUENCIES AND SUBTEST STATISTICS FOR 
TWO HIGH-STAKES DISTRICTS ON THE SUBSCALE: EXTERNAL USES OF TESTS (EXTUSE) 



USES OF TEST DATA. Items 47-57 fist some uses o( standardized test scores. U-e the same S-poInt scale to 
Indicate how frequently standardized test scores are used In your dbtrict for the following purposes. 



50. 
51. 
62. 
53. 
&4. 



To decide on continuation of innovative programs. 



« 


N 
(1) 


R 

(2) 


C 
(3) 


F 

(4) 




50 


3.0 


3.S 


13.S 


71.2 




51 


1.9 


3.0 


12 7 


75.9 


6.4 


52 


1.4 


33 


16.6 


6.12 


15.5 


53 


3.6 


75 


19.9 


42.9 


26.0 


54 


2.8 


25 


10.0 


75.9 


8.9 











...N R.... 






0 f 


N „..R.„. 


0 f 


Means 


District 


Dishkt 


A 


B 


3.55 


3.70 


3.61 


3.77 


3.33 


3.71 


3.29 


3.40 


361 


3.78 



.? 

.7 
.? 
.? 



p<.001 





Combined 


DisHct 
A 


District 
B 


Subscale Total Scoxr. 


18.15 (sd«2.$l) 


17.73(S.76) 


18.27(2.80) 


Aireiage Item Score: 


3,6 


35 


3.7 


N« 


247 


44 


197 



well as thossi who dided the 7 to indicate that they oould not answer. Only response categpries 14 were used 
in ths calculation of ine>ns. hence the dramadc reduction in sunpVs size compared to other tsbles. 
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Table 9 

ITEM MEANS AND FREQUENCIES AND SUIBTEST STATISTICS FOR TWO HIGH^TAKES 
DISTTUCTS ON THE SUBSCALE: PRO STANDARDIZED TESTING (PRO-TES-i 2) 

THCETFeCTSOf STANOAROBEOTesma Today in th« Unk«d 6tM IhM ii irMriM dflM* SbM wtMUwr 
the AarKtwdiiad tMting InpoMd on aehoois (or MOOuntabNy pur^^ 
pottntialV posUvf ind rN(^i)M oRocti of Mindard^ 









Nt 


























































C2. tandvdMMinoli Filipino tehooiilr^^ 












C7. VMtfiMtiMlotrtforotMndir4i.iiud«ntt 
oKgraatwthouipfWfQuiiH tkitt 












n Tnli OMi mt bitportM ftidbidc tboui flow wil 1 vn McNnQ 
InMcficurrtoolirMt. 












71. ThiimpoiianoimehtdiotMirti^g^MiMchMa^^ 












7Z My8Clttort«mp»MlioftMritti9tfwiMafMlOQmmi^^ 

10 (■MnQ InNNni ■OnWWTWl. 












74. TMOhnwnoeotnpMntOeuiMUngMuMMypoofw 
Moehm wtn do not ««« to b« aoeouniabtt. 












75. Pocutingont«Min««Wfn«nftmm«i«yattMbuics 
bitoro going on 10 othtr nulwUL 







Free 


[uendesin? 


b 


# 


strongly 
agree 
(4) 


modeiat 
agree 
(3) 


moderat 
disagree 
(2) 


strongly 
disagree 

U) 


ID 

opinion^ 


58 


21.3 


60.4 


10.5 


5.5 


2.2 


59 


12.7 


39.9 


17.5 


21.9 


8.0 


61 


7.5 


19.7 


29.4 




8.3 


62 


4.4 


25.2 


26.0 


38.0 


6.4 


67 


4.7 


18.8 


24.1 


47.1 


5.3 


68 


9.1 


39.1 


23.0 


24.7 


4.2 


71 


4.2 


16.9 


22.7 


493 


6.9 


72 


11.9 


313 


22.7 


24.4 


9.7 


74 


3.9 


10.0 


205 


56*8 


8.9 


75 


6.1 


24.4 


24.7 


393 


5.5 



Means 



District 
A 


District 
B 


'<i.93 


3.01 


2.60 


2.44 


1.96 


2.00 


1.89 


1.98 


1.69 


1.83 


231 


2.35 


1.66 


1.76 


2.24 


2.37 


1.25 


1.67 


1.85 


2.00 



p<.001 



Subscale Total Score: 


Gombincd 


District 
A 


District 
B 


Average Item Score: 


20.75 (sd«534) 


19.84(4.78) 


21.01(5.73) 


2.1 


2.0 


2.1 


N- 


251 


51 


193 



^Hote: H^e question itiark frequencies include the small percentage of teachers who left the question blank as 
well as those who drcled the ? to indicate that they could rict answer. Oily response categories 1-4 were used 
in the calculation of means, hence the dramatic reduction in sample size compared to other tables. 
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Table 10 

ITEM MEANS AND FREQUENCIES AND SUBTEST STATISTICS FOR TWO HICH-^STAKES 
DISTOCre ON THE SUBSCALE: MEASUREMENT-DRIVEN INSTRUCTION (MEASDRI V) 



OjnQfti 



M. l i piiidfwofiiimttMCWngtf>d>ioydm^ 

. _ ^ Kfititffll 



Ml ioofit in to importinL 



A MytetiooliimwifitriMdtnjyffMi^ 
. improving 0¥m«udMtlNminOi 



A HV^-ofdirtMdMiltliMMmMnoloilto 



H. QiiidclMmnMlodotrMvnMaci^^ 
tiM 10 kMp dwig on mt Mies* 



7a l0on1oM«$ayit«durtnomiyourbiCfc»«lw«nv 
Mdra to hM prMk» Mndvdiztd tM km 



73. Alqtof«»wort(ftookandi«&ookocttyfci«lM^ 

10 do «• vtfv airT^ 10 M ihon poHogM and MAMona 
qMUont mil itudMi wl onoounMr on toM. 



cauMMy onl oofT^tM «M M Mtt 



77. litndtodrilstudoruonbftstoskKsboeauMtMtimoonV 
way t can tM sura mty wl g«i wn« tfwy ruiy noad 10 know. 





I 


'inequencies 


in% 








Means 




« 


strongly 
agree 
(4) 


modcrat 
agree 

(3) 


moderaL 
disagree 
(2) 


strongly 
disagree 
(1) 


m 

opinion^ 




DUtrict 
A 


District 
B 


60 


34.1 


29.9 


14.4 


175 


4.2 




2.68 


2.88 


63 


22.4 


235 


24.4 


24.7 


5.0 




2.24 


2.52 


64 


19.9 


39.9 


19.4 


15.0 


5.8 




2.81 


2.65 


65 


12.7 


24.4 


205 


40.2 


2.2 




2.30 


2.04 


66 


31.9 


36.8 


14.4 


14.1 


2.8 




251 


2.99 


69 


19.4 


28.0 


18.3 


28.0 


6.4 




250 


2.45 


70 


5.5 


9.7 


25.8 


49.0 


10.0 




1.66 


1.69 


73 


18.6 


433 


18.6 


14.4 


5.0 




258 


2.73 


76 


15.5 


26.6 


16.9 


28.0 


13.0 




2.11 


2.40 


77 


103 


363 


22.4 


24.4 


6.4 




2.31 


2.36 


































District 


District 






Subacale Total Score: 


Combined 


A 


B 












24.40 (sd«6.46) 


2324(6.38) 


24.93(6.40) 






Average Item Score: 


2.4 




25 


25 




i 






N- 


244 


ri 


187 



*Note: 'No opinion' responses are excluded from the calculation of means. 



p<.001 
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Table 11 

ITEM MEANS AND FREQUENCIES AND SUBTEST STATISTICS FOR TWO HIGH-STAKES 
DISTRICTS ON ODD ITEMS: WRITING ESSAYS, MULTIPLE CHOICE EXERCISES, 
COMPUTATION EXERQSES, RETENTION DEQSIONS 



7. Writing essays . 

6. Us« of muiUplschoic«.fia4n and maiching exercises 
16. Timed computational exercbes 

47. To promote or retain students. 



t»wiftnpfiii<i mmmwmt l— rrryftfi 

> N**««M R 0 F 7 



Frequend 


iesin% 


moie 
emphasis 

(3) 


not 
Influenoed 
(2) 


leu 
emphasis 

(1) 


blank 


71.2 


23.0 


5.5 


0.3 


60.4 


33.8 


3.9 


1.9 


43.2 


47.1 


8.6 


1.1 



Average Item Scoie.il 7 
N- 


Combined 


District 
A 


District 
B 


2.65 (sds058) 


2.04 


2.84 


369 


80 


280 


Average Item Scoredl 8 
N- 


Combined 


District 
A 


District 
B 


258 (sd=057) 


2.54(0.59) 


2.59(0.56) 


354 


80 


274 


Average Item Scoxehl 16 

N- 


Combined 


District 
A 


District 
B 


235 (sd>0.63) 


2.29 


2.37 


366 







Ppequencie8in% 





N 


R 


O 


F 


7« 




(1) 


(2) 


(3) 


(4) 


« 












47 


253 


22.7 


26.6 


13.9 


11.4 



p<.001 



Average Item Score: 
N« 


Combiitcd 


District 
A 


District 
B 


233 (9d«1.06) 


2.10(0.94) 


239(1.08) 


328 


71 


249 



^Note: The question mark frequencies include the small pen:entage of teachers who left the question blank as 
well as those who dided the ? to indicate that they could not answer. Only response categories 1-4 were used 
in the calculation of means, hence the dramatic reduction in sample size compared to other tables. 
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Table 12 

FREQUENCffiS FOR HOURS AND DAYS SPENT GIVING STANDARDIZED TESTS 



80 Estimate hov/ much time you have spent In your dassroom this school year gfcdng standardized tes 
. (ASistering the reading and math subtests of a typical standardized test takes about 4-5 hours.) 

I lUMthw4houw ( l^houw I ]»-iehour« ( ll7ormorthou« 

81 . How many days have been Intermpted this school year as a result of gh/lng standardized tests? 

I ]2orftM«r I 1« I IMO I lllofmof* 







<4houn 


4-8houn 


9-16 houn 


>17hottn 


blank 




Di;$trict A 


0.0 


583 


292 


125 


0.0 




District B 


.«.o 


16.2 


513 


28.3 


1.0 




N=:99 














Combined 


2.4 


24.4 


47.2 


25.2 


.8 




N=:123 













# 




2/fewer days 


3-5 days 


6-10 days 


11-f days 


blank 


81 


District A 

N=24 


0.0 


79.2 


16.7 


4.2 


0.0 


District 5 

N=:100 


3.0 


12.0 


56.0 


29.0 


0.0 


Combined 

N=:124 


2.4 


25.0 


48.4 


24.2 


0.0 



Table 13 

CORRELATIONS AMONG SUBSCALES AND ODD ITEMS 





rrafaura 


Ofiiiiina 


uivafQan 


1 Mipiap 


wuniiov 


iniUSv 




Prtssurft 


.73 














Skiiiifis 


.13 


.83 












DiV8fQ#fl 


-.18 


.26 


.84 










T«$tpr«p 


.24 


.34 


.09 


.87 








Controv 


.23 


.10 


-.40 


.06 


.94 






inlus« 


.29 


-.10 


-.24 


.03 


.42 


.84 




Exus« 


.22 


-.07 


-.12 


.00 


.24 


.61 


.84 


Prot0St 


-.16 


.20 


.28 


.12 


-.16 


-.08 


-.09 


Mtasdriv 


.30 


.27 


-.18 


.26 


.45 


.21 


.21 


Essays 


.07 


.23 


.14 


.42 


-.08 


.13 


-.02 


MuNip 


.09 


.23 


.00 


.26 


.09 


.02 


-.03 


Timaoomp 


.06 


.16 


.14 


.18 


.05 


.03 


.06 


Promratn 


.10 


.05 


-.10 


.19 


.37 


.44 


.26 


TMlhrs 


.14 


.03 


-.12 


.22 


.19 


.00 


-.04 


Intdays 


.20 


.00 


-.27 


.21 


.20 


.16 


.08 



Protasts Measdriv Essays MuHip Howoom Promretn T«&thrs intdays 



.81 












.10 


.82 










.14 


.04 


.87 








.01 


.10 


.06 


1.00 






.02 


.14 


.00 


.09 


1.00 




-.11 


.14 


.07 


.08 


.05 


1.00 


-.13 


.15 


.04 


.01 


-.05 


.12 


-.18 


.20 


.14 


-.02 


-.08 


.14 



NOTE: Internal consistency coefficients are reported on the diagonal. Ns for 
individual pairwise comparisons ranged from 102 (for the MEASDRIV by 
INTUSE correlation) to 346. 

Correlations significant at p < .01 are in bold face type. 
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Table 14 

Summaiy of Teachers* Answers to Open-ended Questions 
on the Positive and Negative Effects of Standardized Testing 

( District B» N - 280 ) 

Personal Experience with Standardized Tests. The debate about testing--pro and 
con-is often vague. Do you have specific examples of how standardized tests have 
had an impact on you or your students? 

78. Give one or two examples of how standardized tests helped you to improve the 
quality of education in your classroom or for particular children. 

79. Give one or two examples of how staiidardized tests had a negative influence on 
your teaching or student learning. 



Non Response: 17% (S^Al) did not respond to the open-ended questions. 



Only Positive: 6% (N=17) cave only positive answers, some said specifically that 
question 79 was not applicable. Categories of only positive responses are as follows: 

Tests set instructional goals and ensure coverage and pacing: (N =3) 
"Teachers combined resources and ideas which helped me prepare meanmgful 
teaching lessons." 

"Knowing the scope and extent of info to be tested allowed me the opportunity 
to insure that these areas were taught or familiar by test time." 

Identify weakness so as to refocus instruction (for the class as a whole): (N-3) 
"I use the test results as one indicator of personal performance. Any time my 
average scores are not in stanines 7,8,9-1 know I need to reevaluate and 
restructure my program." 

'Testing allows me to determine which areas of my curriculum need 
improvement." 

Identify strengths and weaknesses of individual students for extra help: (N=5) 
'Test results give me a basic idea of what my students are weakest in. Then we 
can really work on these areas." 

Improve achievement: (N='2) 

"(The state test) improved wnting skills for all my children." 
"Improve reading stamina and learning more vocabulary words." 

Motivate students: (N=2) 

"Tests give children goals. They know what they have learned is acknowledged." 
Other positive responses: (N=2) 

c.g., "They gave me the idea of teaching beyond my grade level thus making it 
easier for my students in later years...." 



Positive Responses: 54% of teachers (N«150) gave both positive and negative 
responses to questions 78 and 79 respectively. Categories of positive response are as 
follows: 

Tests set instructional goals tnd ensure coverage nnd pacing: (N* 18) 

They have given me a clearer notion of what it is I am supposed to teach." 
**Standardized tests are used as a guide-like sets of objectives.** 
"Cover more materisd in less time?' 

**Standardized test gives me standards for teaching and a goal to work toward. It 
motivates me to use different teaching styles to effectively reach all of my 
children.** 

*'I think the tests encourage teachers to insure learning." 

**Standardized tests facilitate greater consistency in the curriculum both within 

classrooms at a single grade level as well as across grade levels school wide." 

'*Directs our attention to particular skills we use in our reading, writing, thinking, 

and discussion of a whole (integrated language arts program). ' 

"more intense teaching.'* 

Ensure teaching of basic skills: (N> 12) 

"Standardized test gave a nice vanetv of the different basic skills that the 
children needed to master." 

"The tests help us focus on certain basic skills that otherwise we may be some 
what relaxed (about)." » 

"Give me the mcentive to drill on a daily basis-for a short, short time- 
fundamental facts.'* 

"Puts emphasis on basic skills to that slower students master them." 

"I probably put stronger emphasis on certain reading skills such as getting the 

main idea and reading for details in order to achieve a higher level of 

performance." 

''Stress on standardized items and repeated repetition helps the slow learners in 
the group." 

"It helps focus some teachers on basic skills who may otherwise not focus on 
anything at all." 

*'Because of these tests I am constantly reviewing and reinforcing skills taught all 
through the year." 

Identify weakness so as to refocus instruction (for the class as a whole): 
(N-30) 

"I use tne results to help me identify the areas that my students are weak in, and 
focus on those areas first before implementing new concepts." 
'The areas on which a large % score low (usage, etc.) I evaluated and 
restructured my instruction to improve student understanding and mastery." 
"I paid closer attention to specific objectives and goals. I used it as a guide to 
personal improvement in teaching." 

They are flQ£ resource for feedback on what I need to emphasize more in my 
lesson planning." 

**Some tests ask questions involving higher level thinking skills which require the 

teaching of those higher level thinking skills." 

"I found out students needed more teaching of fractions and number 

sequences." 

*'Hmphasizingskills where students were low-using different strategies to teach 
there skills. (District is getting better about offering training and encouraging 
the .se of new strategies.)" 

"It akrted me to focus on the skills the children were deficient in." 
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"After I have eiven pretests, I can analyze the results and concentrate on wltat 
areas are low. ... ^ . 

-Identifying weakness, specifically in comprehension, has caused us to 
emphasize critical thinking skills and aUowed greater flexibUity in exploring new 
tecnniques.** 

Identiiy stitngths and weakness of Individual students for extra help? (U^ ^9) 
'Saw weaknesses in pretest and individualized instruction or used small gic up 
instruction to help master basics." . ^ ^ 

''When the test was used as a diagnostic tool to determme strengths and 
wc&kncsscs«^ 

"In some cases such tests have revealed weaknesses in time to remediate; iuch 
wc&kncsscs*^ 

"It gave me a chance to know what to focus on if a child was weak in a particular 

aiea." . , 

"Standardized tests do provide a starting point for emphasizing students 
strengths and weaknesses. The tests also lead parents and te:acher/» to 
discussions of individual students that can be helpful." 
"In looking at test scores I am able to help student with the area that they are 
having the most trouble with." 

"It helps to see what categories they are have mastered and thosr, that are 
having trouble." ^ , . 

"It can highlight skills not mastered to be reviewed over the surflmer and oarin g 
the upcoming year." , /^v ^ u 

"The test showed (1) the areas where students need extra help (2) enabled the 
parents to share in upgrading their child's study habits." 
emphasized word recognition skills after my ESL students did poorly. 

Improve achievement: (N»6) 

"Improves basic skilU ." 

"^tate test) scores went up dramatically." 

"Reading skills and math skills were enhanced." 

Writing test improves writing skills: (N=7) 

"(State test) emphasis on writing results in constant writing efforts m .Ml subject 
areas and aevelopment of a diversity of writing 1 "Tnats." 
"It helped children improve and enjoy their writing." 

Improve test-taking skills: (N»8) 

'Test taking skills help those students who are afraid of test." 

"Children get the experience in taking tests. They need to be test conscious. 

Society leans towara testing for many jobs." 

Motivate students: (N»8) 

"CMdren were anxious to learn to read in order to be able to take the test. 

They were eager to practice problem sohdng skills." 

"Testing is a challenging follow up to instruction. Children love the 

competition." 

"A form of goal, or incentive, to learn and do their best. 
Improve student self-confidence: (N=5) 

"It raised self-esteem tremendously in a couple of students who passed the tests 
much to their surprise." 



"By taking the standardized tests the students felt like they were working toward 
a common goal. A hurdle to overcome. Getting high scores on the practice 
tests made them feel smarter.** 

Used in placement and grouping decisions: (N*2) 

'Standardized tests are part of the prereqisisite to get . ao the gifted class and 

advanced classes." 

Treparing my reading and math groups." 

Provide norm-referenced information: (N«2) 

"It helps to know where students rank among the nation." 

Other positive response;! (N« 13) 

e.g., ""We team taught for (the state test) and found it good for students and 
teacners." 

"It ent^bles pupils to do critical thinking." 

"They i:re sirnple sliaight to the point directions." 

"Affirm current classroom results." 



Negative Responses: 54% of teachers (N= 150) gave both positive and negative 
responses to questions 78 and 79 respectively. These are the same ISO teachers who 
gave positive answers above. Categories of negative responsr: »;e as follows: 

Too much teaching to test content and test format: (NB(36j 

'Teachers feel pressure to *teach to test items' and avoid higher level thinking 

skills." 

"I can*t get to science and social studies, like I would like to Instead, Tm 
preparing my class for a standardized test." 

'riffle I would like to have spent using the Whole Laneuage approach was used 
ur in practice tests and district practice materials. I felt prtjssure from my 
principal to complete all practice materials, because she feels that is the road to 
success*" 

"Critical thinking skills are basically non-existent in our children because of drill 

and practice for (Test 1), (Test 2), and (Test 3). 

"You don't get to do as many special projects or have open discussions." 

"I feel I could be more creative in my 1 saching if I didn't feel the necessity to be 

sure every child understood each basic skill." 

"I wasn't able to do enrichment activities because they weren't part of the 

objectives. Students became burnt out on basic skills." 

"don't go through the curriculum in orderly fashion because trying to cover 

things on ihe test first. Cover som^ unimportant things because they are on the 

test.'' 

"Teaching testing skills takes so much time that much other material cannot be 
taught." 

"The students receive little hands-on learning in place of drill and specific skills 
teaching." 

"Only test objectives are being taught. We need a well rounded curriculum." 
"At times it does not allow me to be as creative as I wish. Especially when it 
comes to the writing. The tests expect so much structure." 
'The timing of the tests in the year covering material not appropriate to be 
taught yet, yet needing to be mastered on the test." 

"I spend less time teaching science and social studies because reading and math 
games are so important." 
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Too much time needed to emphasize test content, test taking slrilk, practios 

^UT^CT^tvvt teaching- You are ahvayi teaching (or so it seems) Jowards th^i 

test. *Conjtant reminders on the PA system about test scores. 

Two weeks prior to testing an increase in practice on test format disrupts our 

usual unit plan, whole language, hands on approach to education. 

'^Constant drill, no higher cognit ve skill leammg for slow learners. 

"Our lesson plans have to show n ^hat test objectives we're workmg on for the 

week. I'm forced to teach for th s test" 

"We spend too much time at ow school on the basics and test takmg 
techmques." . ^ , . 

"Do not teach but basics on the test Do not teach enough to excel m upper 

?do?t feel I should drill for the test format all year long." 

"Had to move on too quicUy in order to cover all matenal required on test. 

Too much time, too much testing: (N"8) 

"I have a poor attitude because here in (State) we test too much. There are 
State tests and national tests. We don't need so much nor so often. It makes 
me, and my students hostile toward testing." 

"We spenr. too many days on standardized or district made test preparation 

matenals.' . . . . . 

"The time required to test detracted from time needed to teach- Vre used 10 

days fji fifth grade this year." 

Test preparation boring to studeocs: (N=4) 

"Students and I become bored with practicing test tabng skills. ^ 
"Kids start tuning out because they really dishke there tests (and that s with a 
positive approach from a teacher)." 

Stressful for students and teachers: (N»28j 
"Pressure to do well makes everyone uj2iight>" 

"Some of the children feared taking the tests, therefore they did not do as well 

as they were capable of doing." . 

"It tends to add a small amount of presiare on the students, especially when 

thev know how to do a certain proolem but can't remember." 

•Too structured and fkes'fun'out of learning." 

•The pressure on the st jdents especially the younger ones of actually taking the 
test itself. Some were vcn? nervous while others thought that the test was fun." 
Toci much emphasis is cut on student performance in negative ways such as 
newspaper reports. This causes student and teacher pressure and stress to 
overperform. 

Too much emphasis and pressure from the princlpl, district and media. In my 

opinion, the pressure encourages cheati ng from a lot of teachers." 

"Mainly that the students and teachers get too stressed out before, dunng, and 

after the test." , , 

'Students concentrate on *what will ftc included on the test, students are 

stressed out and get too nervous during the test." 

"Students worrying about passing or doing well. Wasting time tiymg to comfort 
and tering them to do their best? . 
'These test scores arc used to measure my effectiveness as a teacher in the 
classroom. The pressure from administration for high test scores and the media 
has a negative effect on teachers." 
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"Because the test results are 'flaunted* in the news etc., I become veiy resentful 
at having to give the tests. The pressure if bad on teachers and therefore passed 
cn to students. It*s as if school's out when testt are completed.** 
The students are put under an unimaginable amount of pressure while takine 
the test and when results come back, depending on where they are, their level of 
esteem is considerably dropped. 

Inaccurate measure of teachers, ftudents. or curriculnm: (N«24) 
''These tests may test objectives not included in the curriculum.** 
'*A student may nave good Imowledge of information but when it comes to 
testing, they can't demonstrate it as well as eveiyday classroom learning.** 
"I feefthat when scores are published other factors should be included, i.e., 
economics, and social variables.** 

'*(l)Comparison to others, negative efifer< on schools, districts, and students, (2) 
Year to year comparisons, i.e., growth, not including subjective factors like 
health, environment, etc..** 

The standardized test does not ahvays reflect the capabib'ty of the student. 

Some students test well but will perform low and vice versa." 

*These tests may test objectives not included in the curriculum.** 

''When a student scores high and is expected to function at that level, but the 

student can't actually perform at grade level." 

"Tests inappropriate SKills." 

"Children that don't test well are penalized." 

"A teacher in the previous grade teaches to the test and as a resuh when 3rd 

Sade gets its chilaren we see negative growth in many subjects each year, 
[though we know the e^lanation, this negative growth has shadowed our 
opinions of the jobs we've done...." 

Negative influence on slow learners: (Nb5) 
"labels children." 

"When students with excellent scores criticized students with lower scores, thus 
making them feel inferior." 

"Standardized tests have had a negative influence on student learning when they 
are used to display their performance in a negative way rather than in a positive 
manner." 

Culturally biased: (N«4) 

"Cultural backgrounds being tested and compared-unfair to disadvantaged 
students." 

"My bilingual students are sometimes not ready to take a standardized test but 
the district demands it Students often guess and get frustrated." 

Undermines teacher oonfidence: (N«4) 

"Teacher morale! We have been harassed unmercifully this year, and it is 
wearing us out" 

"When students still don't score as well c^en when you have been reteaching 
and going o^' er skills consistently all year." 

/ 

Other negative responses: (N«7) 

'Publication in the newspaper of various schools' scores." 

"The necative side of the issue is that the test results ore used for all kinds of 

reasons besides helping the child." 

"Some become frustrated during the actual test." 



Timed for mastciy; some students require more time than most tests allocate. 
The reading passages are biased in reference to some students environment. 

Only Negative: 24% (N«66) gave only negative answers, some said specifically that 
they could not think of positive answers to question 78. Categones of only negative 
responses arc as follows: 

'nio much teaching to test content and test format (N«28) ^ 

"Repetitive, borisig drill. Use of district materials required too much tmie. 

"Standardized test has had a negative influence on my teaching because some 

enrichment as well as practical skills must be set aside.** , 

"I feel very sorry for the rhildren in (State) whose school admmistrators and 

media cause them to mis'i out cn so many areas of learning. Many teachers ^ 

focus cnlx on what tests cover, and I believe the children are suffering from it. 

"I teach test Uking skills because it's required." . 

"I was given an abundance of testing practice material each month and 1 was 

told I must do it.'' ^ ui i • « 

'Too much time spent on test-not enough on thinkmg/problen solving. 

"I had to spend a lot of time going over basic reading skills in all subject areas 

and basic skills in math and ffelt i couldn't do a lot of fun math and science 

activities until the test was over." r . 

"We are constantly reminded ♦o practice, practice, for th»i test. The tun and 

excitement has been taken o-jt of teaching." ^ , 

"Many types of discovery ieaming cannot be assess* d. Therefore not as much 

as shouufbe is spent on this type of learning, e.g., science projects, group 

experiments." 

"I always felt things were being rushed so much. Test objectives are becoming 
the subject matter and the activity of the day." 

Too much time, too much testing: (N=6) , ^. . 

"I field-tested next year's (State test); students took one pretest for the district to 
prepare for (State test), the State test, and finally the (nationally normed test). 
Four weeks of testing IS too much." 

"Students became bored with the repetitive practice tests. Valuable class time 
was lost to practice test." 

Stressful for students and teachers: (N=13) , 

"Highty stressful. I don't feel the results arw accurate. Ovcrwhehmng demands 

ancfexpectations on teachers." 

"Everyone gets worked up about the test and misses the joy of learmng. The 

test makers gear the test toward the WAS.P.S and exclude most other racial 

minorities." . ^ . 

'Too much pressure on the students and the teachers. There is not enough time 

to do extra supplemental activities and enrichment work." 

"Students are very nervous and worried about the tests. They tend to shut down 

and not tiy their best." « 

"The teachers will cheat so that their studcu!?: will score well, then go around 

bragging about it later. It creates a lot of animosity because eveiyone thinks 

that everyone else is cheating." 

"Stress on teachers and students to be successful in a testing situation that shuts 
down the normal routine for children and teachers." 
"Anxiety among the children; reduction in teacher choice; de- 
professionalization of teaching field." 
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NefAtiveorunftiireoniptrisoni: (N«7) 

"Our itandardized test scores are typed up and passed around to eveiy teacher 
to compare how they did to others, no mention goes to the teachers with low 
classes. It is much too competitive." . ^ 

*^en two rooms had low scores, all third grades suffered the mdignity of 
having a week^ sit-in by a so-called *(Testy helper who did oQShing but sit." 
"I do not feel standardized testt are used correctly-they should be used 
diagnostical^ OOi JudgmenUlly. I should feel free to use them to help students, 
and not have them used against me. The public a press constantly tear schools 
apart because their test scores aren*t as high as another. No one sees the fact 
that we don't all get students who are on the same level. Tm tired of being 
portrayed as an incompetent because of low scores." 
^y children are intermediate bilingual and being compared with standard 
English speaking classes which have quite a head start. This is asking more than 
is reasonable for some of them." 

Inaccurate m$Si.'ore of teachers, students, or curriculumi (N«6) 

"Some of my students don't test well, although they are veiy creative and high 

8cliicvcrs«^ 

"Because of the extreme emphasis placed on these tests, I feel the results are 
not valid." 

"Some people have test anidety. The results are not related to their everyday 
skills. This can track a child and determine future placements." 
"I feel that the tests we take are not testing what we should know at that grade 
level. Too much preparation time is expected." 

Negative influence on slow learners: (Nb3) 

'Timed tests are veiy frustrating to a class of below average, at risk students. 
They need situation: that are successful." 

"The negative influence is reflected in the students* self concept. They compare 
their scores." 

Uadeimines teacher confldence: (N=2) 

"It is discouraging to me and I feel very constricted as a teacher. Vm the 
example: discouraged. I'm applying to another district." 

Other negative responses: (N^^ 1) 




